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ABSTRACT 

An axparimantal study invastigating tha in tar act ion 
of linguistic complaxity and parformanca in child languaga 
acquisition tasts tha hypothasis that childran laarning a first 
languaga acquira ralativaly complax santances sonawhat la tar than 
lass complax santancas. In ona of thraa tasts, tha subjacts, 44 
childran agad 3.6 to 6 yaars, wara prasantad with a sarias of 
structurally divarsa santancas about toys within raach. Tha 
childran *s comprahansion of tha santancas was obsarvad in thair 
reactions as maasurad hy thair movamants of tha toys. Tha sacond and 
third tasts usad two iuiitation tasks focusing on advarbial 
structures, using tha sama toy-moving santancas, varying in langth, 
and toy-moving santancas carafully controllad for langth, to find tha 
subjects* critical sentence length for imitation. Analysis of the 
various responses to sentence types and complexity in all thre«t tasts 
supported the *'waak complexity** hypothasis and suggest that tho 
linguistic factors affecting parformanca are task'-^dapendent and deny 
a simple ona-to-one relationship between complexity and performance. 
(MSB) 
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This ^experimental study investigata'^aj the interaction of linguistic 
comDlexity and performance in child language acquisition. Although the 
role of linguistic complexity in performance is not well understood, two 
quite plausible assumptions about it are often made. It is often assumed 
that linguistically complex sentences involve more effort in production and 
understanding than do less complex sentences; and that children learning a 
first language acquire relatively complex sentences somewhat later than 
less complex sentences . The study reported here was designed to test the 
second of these ideas. It is based on a linguistic theory of complexity 
developed by the first author (Smith, forth- coming). 
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In their strongest form, the plausible assumptions in question make a 
rather simple hypothesis about the relation between linguistic complexity 
and performance - namely, that the one affects the other in a consistent 
manner that does not vary with the type of performance. One might expect a 
weaker, less simple relation. It seems likely that certain factors of 
complexity affect performance more than others; and that not all types of 
performance are affected in the same way by linguistic complexity. We will 
refer to these as the strong and weak forms of the Complexity Hypothesis. 



In this study young children were presented with sentences that varied 



other things, particularly cognitive things, being equal. 
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in linguistic complexity using two tasks, toy moving and imitation. These 
tasks make quite different demands of a subject. If the strongest version 
of the Complexity Hypothesis is correct, the effect of linguistic 
complexity would be similar for both tasks; if a weaker version holds, 
different types of complexity might be dominant in the different tasks. 

Our theory of complexity is a modular one, identifying several factors 
that contribute to the linguistic complexity of a sentence. The present 
focus is on syntactic and semantic structure: the factors of systematic 
complexity, surface structure complexity, and interpretive complexity are 
taken to be most important^ . These factors are assessed for individual 
sentences, resulting in a complexity profile. The assessment is based on 
the derivation of a sentence in the grammar of the language. The basic 
idea is simply that sentences are complex when their derivation involves a 
relatively long path - in terms of operations, a large number of steps 
- through the grammar of a language. 

The theory is not unrelated to the ill-fated Derivational Theory of 
Complexity (DTC); indeed, it is an attempt to use what was interesting 
about that theory while avoiding the shoals on which it foundered. An 
important different between our theory and the DTC is that we focus on 
surface and semantic structure rather than the rules and intermediate 
structures that were essential to the DTC(cf. Fodor i Garrett 1967). 
Further, our treatment of interpretive complexity adds an important 
dimension. 



^In its present form the theory does not include morphology or phonology, 
but It can very naturally be extended to them 
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highest MP and PP nodes in surface structure.-* 

Interpretive cotnolexitv deals with the difference between surface 
syr tactic and scopal semantic structure, or Logical Form. It arises when 
the two structures are not one-to-one! in such cases interpretive rules are 
involved in the grammatical derivation of the sentence^ , Sentences with 
empty categories in surface structure, that must be interpreted, are 
relatively complex in this sense. Quantifiers and other elements that have 
wide scope also add to interpretive complexity; they involve rulss relating 
surface structure to scopal semantic structure. Interpretive complexity is 
crudely assessed, at this stage of the theory, by the number of 
interpretive rules in the derivation of a sentence; a more sensitive method 
of assessment may be developed later. 

A sentence may be relatively simple at one level, and relatively 
complex at another. Perhaps the roost complex sentences are high on all 
levels. There is a certain trade-off between levels, in that a short 
sentence with a reduced surface structure - that is, a structure that 
involves empty categories - requires inte-pretation. Consider 3 for 
example, which contains an empty category indicated by [e]^ 



^Bounding nodes in Government Binding theory. See Smith, forthcoming, for 
discussion. 



^Assuming a linguistic theory that de«is with semantic interpretation » 
such as Government Binding theory 

^We assume an analysis of this sentence in which before introduces a 
reduced dependent clause. 
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3» John arrived before Mary [e] 

This sentence has few terminal elements: in surface structure 
complexity it is relatively low. The empty category requires 
interpretation, so the -sentence has seme interpretive ccaplexity. 

1.2 Structures Used in the Study 

Three related adverbial structures that aiffer in complexity, and 
whose linguistic properties are quite well understood^ were chosen as the 
focus of our study, in these sentences adverbs introduce full and reduced 
surface clauses, allowing comparison between them. The examples 
illustrate. 

4. John called Mary befo*^e Bill invited Sue 

5. John called Mary before inviting Sue 

6. John called Mary before Sue 

These sentences vary in rpretive complexity: the first requires no 
interpretive rules besides the standard rules for tense and aspect, 
irrelevant here (see Smith, forthcoming, for discussion); the second 
requires that one empty category ?e interpreted; the third requires that 
two empty categories be interpreted. (We can say this much without 
committing ourselves as to how the interpretation is made.) We will refer 
to the different structural types according to the structure of the 
dependent clause: S, Ving, and NP. 



They are discussed at length in Smith 1978. 
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The sentences above also differ in length, of course. In our terms 
they differ in amount (number of terminal nodes). Since they have the same 
density the amount difference is decisive for surface structure complexity. 
In surface complexity the order of the structures is reversed: the full 
clause is roost complex because it is longest, Ving shorter, the N? 
structure least complex in S-structure because it is shortest. They have 
the same systematic complexity. 

The strong version of the complexity hypothesis does nov. differentiate 
between types of complexity, treating them in a blindly additive way. This 
version would predict little or no difference between sentences with 
reduced and full adverbial clauses. The reduced sentences have a higher 
degree of interpretive complexity, which is balanced by their lower amount, 
hence lower surface structure complexity. The NP structures are ambiguous, 
however, which may contribute to their complexity in a way not captured by 
our complexity measure^. 

Weaker versions of the hypothesis would predict different complexity 
effects in the toy-moving and imitation tasks, according to their demands 
on the subject. Very generally, toy-moving focuses on interpretation and 
might be expected to be sensitive to interpretive complexity; imitation 
requires production of a surface structure and might be sensitive to 
surface complexity. 

Section 2: The Toy -Moving Experiment 



The question of ambiguity is discussed below, and in Appendix C. 
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Toy-moving tasks are frequently used in studies of linguistic 
eompetenee, because they show how subjects understand the basic roles 
talked about in a sentence^^. Toy-moving is most useful with structures 
requiring interpretation to fill these roles, e»g. control structures or 
sentences with pronouns (ef the work of Solan 1978, Carol Chomsky 1969, " 
etc). Toy-moving tasks have an important limitation: they do not indicate 
the grammatical structure that a child ascribes to a sentence. This is 
clear from the extensive literature on .«,entences with relative clauses, in 
which different researchers give different interpretations to identical 
toy-moving responses. It is claimed that children ascribe to the same 
examples conjunction structure (Tavakolian 198I), extraposed relative 
clause structure (Sheldon 1974), and differently attached relative clauses 
(Solan & Roeper 1978). For the structures we are working with, toy-moving 
tasks are quite informative. Both Ving and NP dependent sentences have 
phonetically empty elements, requiring interpretation of just the type 
mentioned above. Children's reponses in a toy moving task' indicate whether 
they know the adult interpretation to these structures; and what other 
interpretation, if any, they give to them. 

Sentences with adverbials of the S, Ving, and NP type were presented 
to children in a toy-moving experiment. The main variable was the 
structure of the adverbial sentence. A second variable was transitivity, 
of main and dependent clause* The stimulus sentences varied in length due 



^^These roles are the theta roles of GB theory: agent, patient, etc, 
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to differences in both structure and transitivity. ' In terms of the 
theory, they varied in both interpretive and surface structure complexity. 
Tba test sentences thus confound two factors, amount and structure. Note 
that confounding could not have been avoided by holding constant the length 
factor. If we had lengthened the single NP sentences, the lengthening 
material would have made some sentences different from others. Two 
temporal connectives (^llaca and aCian) were used; this variation was 
lexical only and did not affect systematic complexity^ ^ . The specific 
types of sentences are indicated in the table below. This task has been 
used repeatedly in work on temporal connectives, varying the connectives 
and order of main and dependent clause: e.g. Amidon 4 Carey 1972, Barrie- 
Blackley 1973, Clark 1973, Coker 1978, Feagans 1974, French 1977, Keller- 
Cohen 1975, Johnson 1975, and others. The form of the dependent clause has 
not before been varied, however. This body of work gives a good background 
for interpreting our results: in particular with respect to order of 
mention and the different connectives. 

If interpretive complexity strongly affects performance in this task, 
then we expect the NP structures to be most difficult. If amount is 
important, we^fypect NP structures to be least difficult; and we expect 
sentences with transitive clauses to be more difficult than sentences with 



These variables were chosen because of their naturalness in the test 
structures, where different forms of adverbial sentences appeared. The 
complexity factors of density and systematic complexity were held constant 
in order to keep the stimulus sentences to a manageable number. 



Systematic complexity is unaffected because these two connectives have 
the same syntactic constraints. 
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Struetural Variations in Toy-Moving Sentences 

Dependent clause: Sentence 
NVN before/after NVM 
NV before/after NVN 
NVN before/after NV 

Dependent clause: Ving 

NVN before/after Vtng N 
NV before/after Ving N 
NVN before/after Ving 

Dependent clause: NP 
NVN before/after NP 
intransitives. 

The stimulus sentences were presented to children, ranging in age 
from 3.6 to 6. The sentences talked about four animals (cow, horse, 
elephant, tiger) and actions that were easy to act out (jump up, sleep, 
tickle, chase, kiss). Throughout, the same four small toy animals were 
visible and within reach. Sentences with S-strueture adverbials talked 
about three or four animals; the other types talked about two or three 
animals, depending on the transitivity variable. The S and Ving structures 
were tested six times each: thrice with before and thrice with after> The 
NP structures were tested twice with each connective. Two filler sentences 
with the connective j^hm were also presented. The sentences are given in 
Appendix A. The sentences were presented in two blocs. The experimenter 
presented the sentences, and an assistant noted the child's actions* If 



10 



the child asked foe the sentenoe to be repeated, the experimenter coaplied 
without eonnent. 

In order to determine the influence ot age on perfomance, we divided • 
the children into three groups on the basis of age, as follows: 

8. Subjects grouped by age 

Gr'iup 111: 5;0 - 6}0 years; 18 children 

Group II: M;6 - 5;0 years; 11 children 

Group I: 3; 6 - 4; 6 years; 15 children 

2.2 Response categories 

There were five initial response categories: Correct, Reverse, Single 
Clause, and Error. Reverse responses reverse the correct clause order and 
have no other error. For example, to the sentence The cow lumned after the 
horse XisH a Correct response would have the horse fall and then the cow 
jump; a Reverse response would have the oow jump and then the horse fall. 
Reverse responses with additional errors were categorized as Errors; there 
were very few such responses. Single Clause reponses are those of single 
actions, whether or not mentioned in the stimulus sentence. (All sentences 
talked about two actions.) All responses with fewer than two elements from 
the stimulus sentence were considered Failures. 



^The numbers are somewhat uneven. A few children dropped out of the 
experiment. Further, five subject groups were collapsed to three. There 
were originally five age groups consisting of children within each 6-month 
range between 3;6 and 6;0. When the data were analyzed there were no age 
differences between the two oldest and the two youngest groups. They were 
therefore collapsed together for the sake of both statistical power and 
simplicity of presentation. 
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Correct and Reverse Reponses were combined Into a single category » 
Correct. This step was taken because Reverse responses apparently 
indicated only the effect of the after connective and reverse order of 
mention; the experimental variables did not affect ^.he pattern of Reverse 
responses. It is known that order of mention, and temporal connective, 
affect performance (Clark 1973, Keller-Cohen 1975). Children and adults 
find it easier to deal with sentences in which the order of mention mirrors 
the actual order of events: sentences with before are easier than sentences 
with aCian. in fact the latter are often misinterpreted; generally befer^ 
is acquired earlier. 

2.3 Results 

Our main finding is that the children knew something about all three 
structures, but the degree of control differed. Interpretive complexity 
strongly affected the results, as predicted by the weaker, task-sensitive 
version of the complexity hypothesis. S structures weiv best controlled, 
Ving structures next, HP structures least. We discuss here the pattern of 
responses; the statistical treatment is presented in Appendix B. 

2.3.1 Correct Responses. 

As shown in the table below, the percentage of Correct responses 
was strikingly similar for S and Vlng sentences, a little over 50^ for the 
group as a whole. The MP structures received significantly fewer correct 
responses, 37>. (This is true for a Correct response category thtt includes 
both Correct and Simple Reverse responses. Otherwise there is a consistent 
difference between sentences with ]tft£&cft and after ^ the former receiving 
many more Correct responses than the latter.) 

U 
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9» Toy Moving J Responses by Structure (I of all 
Responses) 

Correct Single Error 







Clause 




s 


58 


22 


20 


Ving 


55 


22 


23 


NP 


37 


45 


18 



2.3.2 Single Clause 
Responses 

The pattern of Single Clause responses gives additional 
information. We take it that such responses indicate difficulty with the 
stimulus sentence, pace Hamburger and Crain 1982^^ . The children 
generally responded to the stimulus sentences with two actions, although 
all structures have the same presuppositions. The Single Clause response 
was a minimal response in this experiment; only rarely did the children 
refuse to act out anything at all. N? sentences tended to elicit many more 
Single Clause responses, especially from the younger children (see below 
for age-related data). This suggests that S and Vlng structures were about 
the same In difficulty, and that. NP structures were more difficult. In 
discussing the error types we will also argue that the S structures were 



^The work of Hamburger an»^ Craln with relative clauses suggests that 
pragmatic and age factors may make an important difference In children's 
responses to sentences Ulth embedded clauses Involving presuppositions. 

12 
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better controlled than the Vlng structures. 

The structures that elicited relatively laany Correct responses 
elicited few Single Clause reponses, and vice versa, as the table above 
indicates* 

There was a significant relation between age and success* An analysis 
of variance indicated that the oldest children gave a significantly greater 
number of correct responses than the two younger groups (see Appendix B for 
discussion of the statistical analysis). The oldest children also gave 
significantly fewer Single-Clause responses than the other two groups, ^he 
percentages of correct and SingXe-Clause responses are given in the tables 
below. 

10. Correct Responses 

(Correct category combines Correct * Reverse, as noted above) 





Group 111 


Group II 


Group I 




5;0-6;0 


M}6.5;0 


3j6.U;6 


S 


n 


38 


35 


Ving 


70 


36 


36 


MP 


56 


32 


20 



1v. Single Clause Responses 



S 8 43 27 

Ving 2 39 40 

NP 24 52 57 
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4»ount, another variable in the atimulus sentences, did not affect the 
re^onsea in a consistent manner. Shorter S and Ving structures, which 
contained intransitives, did not pattern with any particular type of 
response. The shortest sentences were KP structures, which were the Brv.«t 
difficult. 

2.3.2 We turn now to the category of Error responses. The most 
frequent errors differtd for each structural type, so the structural types 
were considered separately. I^ooking at the responses in this way, we found 
evidence that the children's knowlv^dge of the Ving and NP structures was 
less solid than their knowledge of the S structure. We develop this 
evidence below. 

S structure errors. To appreciate the error patterns it is useful to 
recall the aain characteristics of the S structure sentences. They have 
complete main and dependent clauses, involving three or four animals and 
two different actions. All the material of the S structure is overt: 
subjects must remember the nouns and verbs and act them out in the correct 
relation to each other. There is nothing that corresponds to the 
interpretation of one or more empty categories required for the other two 
structures. 

Most of the error responses correctly presented both actions talked 
about in the stimulus sentences; but one or more of the MPs was incorrect. 
Two types of errors predominated: Doubling, in which one animal does double 
duty, participating in two actions where the sentence mentioned two 
different animals; and Interchange, in which roles are changed, within or 
across clauses. Many responses had multiple errors? almost all errors 
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Involved th« animals rather than the actions. 

Doubling errors all have the effect of reducing the number of animals 
involved in the action: one animal is used for two roles. Four different 
Doubling strategies were identified: A) the Subject of Clause 1 is the same 
as the Subject of Clause 2, noted Subj 1 = Subj 2j B) Obj 1 « Subj 2| C) 
Obj 1 a Obj 2; D) Subj 1 » Obj 2. Examples follow, using one of the test 
sentences. 

12. Sentence: The cow chases the walrus before 
the horse tickles the tiger 
Response: Cow chases Walrus, Horse tickles Tiger 

Doubling Errors 

A: Cow chases Walrus, Cow tickles Tiger* 
B: Cow chases Walrus, Walrus tickles Riger 
C: Cow chases Tiger, Horse tickles Tiger* 
D: Cow chases Walrus, Horse tickles Cow 
*The structurally parallel NP might also be doubled 

The most frequent strategy by far was A, in which either Subjl or 
Subj2 is the agent of both actions. B is an Adjacency strategy : an MP 
that plays one role, an object, is given the adjacent subject role as well. 

The other frequent response type was Interchange: e.g. Subjl 
interchanged with Objl, Subjl with Subj2, etc. Finally there were Outside 
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errors (using an animal not mentioned in the sentence^ ^ )» and Verb errors. 

Multiple errors were ooded according to main error type^t* Doubling, 
Interchange, Verb, Outside* 

13. Error Responses to S structure 
% of all Error responses 
Doubling-types All Doubling - 64 < 

A • 38 % Interchange - 22 J 

- 9.5 * Out - 9.5 » 

C - 7 % Verb - 4.5 * 

D - 9.5 * 

There were few differences in the responses to sentences with 
transitive and intransitive clauses. It is particularly striking that the 
intransitives did not elicit many Outside responses. Of 38 error responses 
to intransitive sentences, only 5 were of the Outside type. This suggests 
that the Doubling and Interchange responses to the S stimuli were due to 
confusion about the MPs mentioned in the sentences and not the lack of 
other animals in the situation. 

We interpret these errors to mean that the children had reasonably 
good control of the S structure, but that the sentences tended to overload 
their memory. In particular it seems likely that the large number of NPs 
and Vs was difficult for the children to hold in memory; see the discussion 
below or the second imitation task, where it is shown that the presence of 



^These were possible for four of the six S sentences, those with one 
intransitive clause. 
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many nouns and verba in a santwee make it difficult to imitate. Mote that 
the task itaelf did not impose any constraints on which NP played which 
role. Indeed » in the various sentences all the animals are sometimes 
agents and sometimes patients. Since there was no particular reason for a 
giv^n animal to play one role rather than another/ and since memory 
capacity was taxed* the children tended to mix up the animals. 

The most frequent response, the A type, has one animal playing the 
agent role in both the main and adverbial clause. Such a response may be 
due partly to the toy-moving task itself. Several investigators have 
noticed that, in acting out multiple actions, children prefer to use a 
single toy as agent (Huttenlocher, Eisenberg, & Straus 1968, Legum 1975, 
Hamburger 1979). 

To complete the argument for this interpretation of the S error 
responses, we consider what a structure-violating response to these 
sentences would be like. Most drastic would be a single clause response; 
and indeed such responses were taken to indicate real difficulty with the 
structure. If the response has two actions, then a structure-violating 
error might make an intransitive clause transitive, or vice versa. There 
were very few such errors. Note thjst it would be rather implausible to say 
that the A-doubling responses reflect a different interpretation of S 
structures, namely as Ving structures or some other type of reduced clause. 
Such an explanation is implausible because it involves an extra load on 
memory. On the usual assumptions about memory, remembering a reduced 
structure is more costly than remembering an unreduced one: a 
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representation of both structure and reduction must be retained^ ^. The 
imitation data suggest that the children's memory spans were already taxed 
by the S sentences, and it is unlikely that a strategy that also burdens 
memory would be adopted* Morever, such an explanation cannot be extended 
to the other Doubling responses. 

We conclude, then, that the error pattern showed the children to have 
fairly good knowledge of the S structure, and some memory difficulties with 
the stimulus sentences. 

Ving structure errors. The characteristic errors to Ving structures 
were of two types. The Ving structu^ lacks an overt subject in the second 
clause; indeed, Ving sentences are''^ ungrammatical with an overt subject 
nallM J2fi£&Il£. hl^/l3&£X\s, leaving ). The c^econd clause subject is 
unambiguously taken to be the same as the subject of the main clause. So, 
to a sentenrte Ih& fiSK shSkSSS. horse before tickling sheep the 
correct response is one in which the cow chases the horse, and then the cow 
tickles the sheep. The children gave many responses in which the subject 
of the dependent clause was not the subject of the main clause: most 
frequent were an Adjacency response and and an Outside response. 

In the Adjacency response the NP object 1, adjacent to subject2 
position, is the actor for clause 2. So to the sentence The cow chases tfte 
hQi:s& J2fi£&Cfi tickling .^i^ .a):i£fia> an Adjacency response has the cow chasing 
the horse, and the horse tickling the sheep. This response corresponds to 



See the discussion of memory, in Daneman and Case 1981, for example; 
and Smith, 1970. 
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the Doubling Strategy B above In the Outside response, an aniaal not 
mentioned in the sent«inee^partieipates as subject or object of the second 
action. These two error types, in which the subjects of the two clauses 
are different, accounted for over 50% of the Error responses to Ving 
structures. 

The other Error responses preserved the equal subject interpretation 
b"t showed confusion elsewhere: the children used an animal not mentioned 
as subject, interchanged the animals mentioned, or confused or interchanged 
the verbs. The frequencies of the different responses ire shown in the 
table below. 

1M. Error Responses to Ving Structures 

% of Error Responses 
I: Subj1 s Subj 2 II: Subjl diff Subj2 



Out 21 * Adjacency 37 % 

Int 12 % Out 17 % 

Verb ^3 % 

We return to the interpretation of the error responses to S and Ving 
structures. One possibility is that the children had difficulty keeping 
straight the animals and roles in the Ving structures, as we have argued 



* Adjacency responses were scored as such if the NPs in question were 
adjacent in the stimulus sentence; or in the response (this second proviso 
applied to Reverse responses to sentences with after . 
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that th«y did for the S structures. In this event we would expect them to 
respond similarly to both structures, since acting them out required the 
same numbers of nouns and verbs* But the response patterns wer^ actually 
quite different, for same- subject and adjacency interpretations. The 
children responded to S structures with a same-subject Interpretation 
(Doubling strategy A) 38* of the time. For Ving structures this response, 
which is the correct one, occurred 55* of the time. The difference in use 
of the adjacency strategy is even more striking: it was used 37* of the 
time with Ving structures and 9.5$ of the time with S structures. 

The di.'ferent responses to S and Ving structures, then, are due to 
their differences in interpretive complexity. The Ving structure requires 
, interpretation; the S structure does not. The Adjacency strategy was 

essentially a response to the empty subject of the Ving structure; its 
frequency, together with the frequency of Outside responses, show that the 
children did not fully control the Ving structure. 

HP structure errors. The most frequent response to the" NP structure 
was the Single Clause; there were relatively few Error responses, and few 
Correct responses, as noted above. The few Error responses were 
Interchanges and actions using Outside animals. We take both as indicating 
poor understanding of the way the lone NP figures in the second clause. 

What is interesting is the almost total absence of errors involving 
the type of action of the second clause. There is only one such error. 
This shows that the children who attempted to act out the second clause 
understo:>d the basic structure of these sentences. Recall that there were 
many Single Clause responses to these structures. We suppose that those 
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children who did not understand the structure were those who acted out only 
one clause - recall that 45> of the responses to NP structures were Single 
Clause responses. Responses in the Error category, then» indicate some 
control of the structure in question. 

It is possible that responses to the MP structures were affected by 
the fact that they are ambiguous. This is an effect of the greatly reduced 
surface structure of such sentences, a feature of their relatively high 
interpretive complexity. We present in Appendix C the evidence for our 
subjects* awareness of ambiguity in the NP sentences. 

2.6 Conclusions: Toy-Moving 

The predictions of the weak complexity theory were borne out. 
The Correct and Single Clause response categories separated the NP 
structures as most difficult: there were fewer Correct responses, and many 
more Single Clause responses, to the NP sentences than to the other two. 
The Error responses showed that the Ving structure was somewhat less well 
understood than the S structure, In that many responses supplied an 
Incorrect subject for the dependent clause. In contrast, responses to the 
S did not Indicate consistent structural misunderstanding. 

The factor of Interpretive complexity predicts the apparently 
paradoxical order of control Indicated in this experiment. Sentences that 
are relatively long and explicit are controlled better than shorter, 
elliptical sentences. One might take difference In performance on the toy 
moving task to Indicate different degree of acquisition. We now discuss 
the Imitation experiments. 

21 
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3* Imitation 

Imitation tasks are based on the interaction between structure 
and memory. The basic strategy is to present for imitation sentences that 
overload a subject's memory to the degree where knowledge of structure 
makes a difference* It is important to find the critical length for 
subjects in imitation tasks. Short sentences can be imitated by rote, long 
sentences cannot be imitated even with structural knowledge. 

Imitation requires the subject to reproduce the surface structure of a 
sentence. It is thus quite different from the toy moving task. If 
linguistic complexity plays the same role in both tasks > that would be 
dramatic evidence favoring the strong complexity hypothesis. The weak 
complexity hypothesis would predict that factors of surface structure would 
be of paramount importance for this task. It is not dear how fully 
subjects interpret sentences presented in imitation tasks; errors are often 
semantic in nature, suggesting partial interpretation'® . The task in 
itself does not require that sentences be interpreted. 

Two imitation studies were done, focussing on the adverbial structures 
used in the toy-moving study. The first imitation study presented the same 
sentences used in the toy->moving experiment, defying the caveats above -the 
sentences varied in length. The second study controlled carefully for 
length. 



^^Smith 1970, Keller-Cohei. 1975 
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3*2. a Results 

Ungth of stimulus sentence predicted the distribution of correct 
responses. Correct responses were quite frequent to sentences of 8 words 
or less; they were quite infrequent to sentences of acre than 9 words. 
Since the NP structures are shorter than the others, this meant that the 
factor of length cverrode the other factors. The predictions of the weak 
complexity hypothesis were thus borne out. We present the percentage of 
correct responses, organized by subject groups, below. The oldest 
children. Group III, did very well with all three structures; variation 
occurred for the two younger groups. 

16. Percent Correct Responses to Imitation Study A 
Group III Group II Group I 

Structure 

S 80> 36lt 25% 

Ving m 333t 36* 

NP 92$ 71* 5Mf 



Although length alone determined success on the imitation task, the 
error data is suggestive for the S and Ving structures. The data, while 
sparse, indicates difference in the children's control of the two 
structures. The comparable responses were too few for statistical 
analysis, so our discussion will be informal. Error responses to NP 
sentences yielded little information. 

S-struotures: the great majority of error responses - S3. 5* were 
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difficult. This result is hardly surprising: length is known to affect 
imitation » and the stimulus sentences varied quite widely in length. 

3«3 Imitation study B 

In order to investigate more closely the imitation results and 
the effect of the task* a second imitation study was run. This study 
focused on the variable of structure and controlled for length; there was 
no variation in transitivity or connective. The intention was to explore 
as fully as possible children's imitation responses to the basic structures 
-full and reduced adverbial clauses- of the experiment. The sentences all 
had 9 words, transitive clauses, and the connective be fore t they differed 
in the structure of the adverbial clause. In order to keep length 
constant, we were forced to vary the nounphrases and verbs: nounphrases 
were complex [determiner + common noun] or single [proper noun]; verbs were 
complex [verb ♦ preposition] or single [verb]. They were presented with 
fillers to avoid predictability and boredom. For example: 

16. S: Michael caught the cat before Pete tickled the dog 

17. Vlng: The clown kissed the giraffe before petting the dog 
16. HP: The monkey waved to the oow before the elephant 

There were four exemplars of each structure; the test sentences appear 
in Appendix A. The sentences were presented in one session, interspersed 
with filler sentences of varying structure. There was at least one filler 
before each stimulus sentence, and three fillers before the first test 
sentence* 
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The subjects were 10 children » aged ^•5» Our earlier imitation study 
had determined that this age group was most responsive to the imitation 
task: Children older than 5 generally imitated most or all of the sentences 
correctly, children younger than 4 tended to be unable to imitate any of 
them correctly. 

3.3*1 Results 

The results of this experiment were surprising: they supported 
the previous imitation study, even though length was held constant. The NP 
sentence were easiest, Ving sentences less easy, S sentences most 
difficult. 

20. - $ Correct Imitation 
Responses 

S Ving HP 

25* 45* 75* 

The children who were less advanced linguistically tended to have more 
difficulty with the S and Ving sentences, but there was little significant 
variation In the group as a whole. 

3.3.2 Discussion 

Since length was held constant we must look to some other factor to 
explain this rather surprising result. Continuing with the hypothesis that 
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iBltfttlon tasks focus on surface structure » w« consider carefully the 
surface structures of the test sentences. 

The sentsnees differ in the number of surface nouns and verbs that 
th«y contain, and this difference patterns with the imitation results. The 
sentences that were easier to imitate have fewer nouns and verbs than the 
others: 

22. - Nouns & Verbs in Test Structures 

S: Michael caught the eat before Pete tickled the dog (6) 
Ving: Michael caught the oat before tickling the dog (5) 
NP: Michael caught the cat before the dog (U) 

Since the test sentences of each structure are entirely consistent 
with regard to length, this factor differentiates between the structures. 
S structures have 4 nouns and 2 verbs; Ving structures have 3 nouns and 2 
verbs, S structures have U nouns and 2 verbs. 

We can only speculate as to the steps required in uttering nouns and 
verbs, especially in comparison to those required for determiners and 
prepositions (and perhaps other function words). Nouns and verbs have more 
semantic content, in some sense, than do function words: the latter are 
partially automatic and/or partially redundant < In order to explore this 
question more fully one would have to understand the processes tapped by 
the task of imitation. 



^Earlier work with imitation tasks has also suggested that high-content 
words make a difference, cf Smith 1970. 
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But the task of elioited loltatlon, straightforward as It oiay seeot Is 
poorly understood. We know that syntactic structure makes a difference, 
and that subjects eonprehend the stimulus sentences to some degree; but it 
is not clear what level or kind of processing is involved, t* see this, 
consider the typical error responses to imitation tasks. 

Semantic substitution *»rrors are very common: subjects tend to change 
articles -e.g. ihfi. for a» t-^imes - e.g. John or the bov for Jim , even verbs 
-e.g.gft for run. See Smith 1970 for discussion and examples of such 
errors. These would be serious errors in the world, or in certain tasks 
involving interaction with a toy world. But for the imitation task they 
seem trivial. Typically in this task a subject hears a succession of 
sentences, one after the other j no response other than imitation is called 
for. In particular there is no requirement that a subject relate the 
stimulus sentences to anything in a real or imagined world. In this task 
it really doesn't matter whether a sentence mentions John or Jim. There is 
nothing about the task, or subjects' responses, suggesting more processing 
than that of short-term memory. 

Indeed, these considerations support the general view that an 
imitation task mainly taps short-term memory. The fact that subjects 
indicate some degree of comprehension is not surprising: studies of word 
recognition suggest that people comprehend automatically words presented to 

OA 

them*^^ . Returning to the results of our experiment, it seems that high 
semantic content demands more -units? time? we do not know- of short- 



For example, Forster 1970 
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memory than does lower semantic content. Syntax is not the only factor of 
surface structure that affects short-term memory, and therefore not the 
only factor that affects success in an imitation task. Concerning the 
difference between linguistic recognition and production, Daneman & Case 
1981 suggest that the latter is more difficult because it requires 
attention to all the features of a sentence, whereas the former does not. 
••Semantic and syntactic features must be dealt with as independent units 
for production.. .these features needn't be treated as independent units for 
recognition. »• Daneman & Case claim that production, and not comprehension, 
requires that each part of an uttt^rance be processed sequentially and 
placed briefly in short term memory. 

From this point of view the results of the imitation experiment are 
consonant with the other results and predictions of the weak complexity 
hypothesis. That hypothesis denies a simple one-to-one relation between 
linguistic complexity and performance: rather, the linguistic factors 
affecting performance are cl^twett to be task-dependent. We have shown that 
the task of imitation involves at least three factors of surface structure, 
length, syntax, and number of nouns and verbs. 

4 . Conclusions 

We conclude that the weak complexity hypothesis Is correct and the 
strong complexity hypothesis Incorrect. The relation between linguistic 
complexity and performance depends on the requirements of performance, 
which vary widely. This Is hardly a surprising result, but it Is a»i 
Important one to document. 

The results of our two experiments are, of course, very different r so 
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different that the relation between the tasks is called into question. It 
seems clear that different abilities are tapped by the two tasks used, and 
that success in one does not necessarily predict success in the other^' 
This fact is of some interest for the field of language development. It 
means that the results of elicited imitation cannot be taken as more than 
partial evidence of children's language knowledge. Tasks of comprehension 
and elicited imitation may sometimes have similar patterns of results (as 
in Lust, Solan, Flynn, Cross, & Shuetz 1981, for example); it is plausible 
to assume that such similiarity indicates good knowledge of the material 
tested. But the patterns of results may differ, as shown in the 
experiments reported here. This shows that different tasks are needed for 
assessing children's linguistic competence as a good deal more than checks 
or safety measures. They tap different abilities and are therefore 
complementary in a very basic sense. 

The modular theory of complexity has been useful, indeed essential, in 
construing these results. The identification of separate factors in 
linguistic complexity has allowed us to vary them and to discover some 
interesting interactions of complexity and performance. The theory with 
which we have been working has proved useful enough to merit further study. 
Our lesults suggest that other structures involving empty categories might 
usefully be investigated in this framework. Of direct relevance are other 
reduced structures, such as empty categories keyed to particular verbs; 
such structures involve systematic complexity and interpretive complexity. 
Also relevant are other types of interpret ive.ly complex sentences involving 
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Similar r*J3ults are reported in Keller-Cohen 1975, (p. 107), 
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quantlficfttlon, complex tenporal 
generally, the three types of 
Investigation as they affect the 
adults. 



reference, and tdverbials. And more 
linguistic complexity need further 
linguistic performance of children and 
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Appendix A List of stimulus sentences 
Toy-Mov:.ng and Imitation A 

These sentences are in blocs according to structure rather than in the 
order of presentation. 

1. The cow chases the horse before the tiger sleeps. 

2. The horse sleeps before the walrus kisses the cow. 

3. The cow kisses the tiger before the horse chases the walrus. 
The horse tickles the walrus after the tiger kisses the cow. 

5. The tiger jumps up after the horse tickles the cow. 

6. The horse ^tisses the tiger after the walrus jumps up. 

7. The horse aleeps when the cow tickles the walrus. 

8. The walrus sleeps when the tiger chases the cow. 

9. The tiger jumps up before chasing the cow. 

10. The cow tickles the walrus before sleeping. 

11. The walrus tickles the horse before kissing the cow. 

12. The horse chases the tiger after jumping up. 

13. The tiger chases the horse after tickling the cow. 
U. The tiger jumps up after kissing the horse. 

15. The tiger chases the horse before the walrus. 

16. The walrus tickles the tiger before the horse. 

17. The walrus chases the cow after the horse. 

18. The cow kisses the walrus after the tiger. 

Imitation B (without fillers) 

1. The horse chased Tommy before the lion pushed Susan. 

2. Harry patted the dog before Mary kissed the cat. 

3. Michael caught the cat before Pete tickled the dog. 

4. Robert fed the chickens before Janie rode the donkey. 

5. The horse kicked the dog before chasing the cow. 

6. The tiger patted the lamb before riding the horse. 

7. The lion pushed the tiger before chasing the zebra. 

8. The clown kissed the giraffe before patting the dog. 

9. The elephant knocked down the giraffe before the camel. 

10. The girl talked to the lady before the boy. 

11. The snake looked at the turkey before the rabbit. 

12. The monkey waved to the cow before the elephant. 

Appendix B Statistical analysis of Toy-Moving results 

The children acted out a total of 18 sentences. Two were filler 
sentences with iibsa as connective. The S and Ving structures were tasted a 
total of six times each, three times with before and three times with 
a£tsn. The HP structure was also tested with both connectives: there were 
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two sentences with be fere and two with aftar . 

In order to statistically determine whether the children responded 
similarly to sentences of the same structure, a Kuder-Richardson (KR-20) 
reliability procedure was used (this procedure is appropriate for 
dichotomous variables). The program Reliability-Model Alpha in the 
Statistical Package for the Social Sciences was used (Hull and Nie, 1981). 
We combined responses to the two connectives since they did not differ. As 
noted above, all sentences tended to be acted out in the order of mention, 
and this meant that the before sentences were correctly acted out whereas 
the aft^r. sentences were acted out in reverse order. 

In using this procedure we created a correct category (including 
correct and reversed clause order responses) and an incorrect category 
(which included the various error responses). The standard alpha levels 
obtained for the structures having six test sentences exemplars (S and 
Ving) were .73 and .7**, respectively. We considered this acceptable for a 
six item subtest. Furthermore, in all cases except one the analyses 
indicated that the alpha levels were not improved by subsequent elimination 
of test sentences. Sentence 13, an S aftar sentence, was the exception; it 
was eliminated from subsequent analyses since the reliability procedure 
indicated that the children treated it differently from the other two S 
after sentences. 

The alpha level for the NP structures, where there were only four 
sentences, was .62, This was achieved by dropping sentence 2, an NP after 
sentence. Again, further elimination of sentences did not improve internal 
reliability. Using only the reliable items (that is, eliminating sentences 
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2 and 13), we were able to assume that responses were sufficiently 
homogeneous to allow deriving a single overall index of performance by 
using the composite score of the percentage of correct responses to each 
structural type. These scores were entered into tn« analyses of variance, 
discussed below. 

Two analyses of variance were conducted using a two factor (3x3) 
design with repeated measures on one f?ctor (using the biomed program 
BMDP2V»Analysis of variance and covariance with repeated measures). The 
factors were age (with 3 levels), and structure type (also with 3 levels). 
The structural type was the repeated factor, since each child acted out all 
structural types of sentences. 

The first analysis determined the effect of age and structural type on 
the total number of sentences acted out correctly (recall that order of 
mention, notionally an order reversal, was considered a correct response). 
The results indicated a significant main effect for the age factor ^Fs9.82, 
p «,0003), and a significant main effect for type of structure (F=6.20, 
p. s. 0003). The interaction effect between age and structural type was not 
significant. A follow-up multiple comparison on the age factor showed that 
the oldest children performed significantly better than the two younger age 
groups, and that the differences between the middle and youngest group was 
not significant. For structure, the results on the NP structure were 
significantly poorer than for the S and Ving structure; S and Ving were not 
significantly different. 

The second analysis considered single clause error responses. These 
were the only erro* responses numerous enough to subject to statistical 
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analysis. Internal consistency was determined, using the KR-20 reliability 
procedure. The alpha levels obtained were .80 for the S structures, .71 
for the Ving structures, and .76 for the NP structures. These levels 
ladieated that the children treated sentences of each structural type with 
sufficient similarity to allow conducting an analysis of variance on the 
data. Again, the age factor was significant (F s 8.62; p=.OO07) Follow-up 
comparison showed that the youngest children's performance was 
significantly poorer than the middle and oldest groups; the difference 
between the latter two was not significant. Structural type was also 
significant (F « 19.91; p s.OOOO). The children's performance was 
significantly poorer with the NP structures than with the S and Ving 
structures; performance on vhe latter two did not differ significantly. 
Appendix C: Ambiguity in NP Structures 

The NP sentences have two interpretations, one with the adverbial NP 
as subject and one with the NP as object of the adverbial clause. For 
instance, the adverbial clause in liacfia iimfil ^ SihS^SL before t^he 
Sim may have the horse tickling the cow; or, the cow tickling the sheep. 
Although there is some evidence in the literature thav children of this age 
are unaware of syntactic ambiguity (Kessel, 1970), the children s responses 
suggest that they were uncertain and aware that more than one 
interpretation was appropriate. 

To investigate the question of whether the NP sentences were ambiguous 
for our subjects, we look at how the lone NP was interpreted and whether 
the children as a group were consistent in their interpretations. The MPs 
were interpreted as both Subject and Object. The older children slightly 
favored the Object interpretation and the younger children strongly favored 
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the Subject interpretation. 



NP Subject and Object Interpretations 



% of Correct Responses by Age Group 



Subject 



Object 



Group III 



60 



Group II 



29 



Group I 



66 2/3 



33 1/3 



Finally, we ask whether individual subjects consistently gave a single 
interpretation of the lone NP; if most children were consistent, that would 
be evidence that they were unaware that the NP sentences had another 
interpretation. Of the 44 subjects, 36 were not consistent: that is, they 
interpreted the lone NP as both Subject and Object. This inconsistency 
suggests that the children were aware of both interpretations of the NP 
structure. (Of the 8 who were co'isistent, 5 treated the NP as Subject and 3 
treated it as Object; the 8 did not belong bo one particular age group.) 
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